智能论文笔记

Estimating Cardiac Tissue Conductivity from Electrograms with Fully Convolutional Networks

Konstantinos Ntagiantas , Eduardo Pignatelli , Nicholas S. Peters , Chris D. Cantwell , Rasheda A. Chowdhury , Anil A. Bharath

分类：机器学习

2022-12-06

Atrial Fibrillation (AF) is characterized by disorganised electrical activity in the atria and is known to be sustained by the presence of regions of fibrosis (scars) or functional cellular remodeling, both of which may lead to areas of slow conduction. Estimating the effective conductivity of the myocardium and identifying regions of abnormal propagation is therefore crucial for the effective treatment of AF. We hypothesise that the spatial distribution of tissue conductivity can be directly inferred from an array of concurrently acquired contact electrograms (EGMs). We generate a dataset of simulated cardiac AP propagation using randomised scar distributions and a phenomenological cardiac model and calculate contact electrograms at various positions on the field. A deep neural network, based on a modified U-net architecture, is trained to estimate the location of the scar and quantify conductivity of the tissue with a Jaccard index of $91$%. We adapt a wavelet-based surrogate testing analysis to confirm that the inferred conductivity distribution is an accurate representation of the ground truth input to the model. We find that the root mean square error (RMSE) between the ground truth and our predictions is significantly smaller ($p_{val}=0.007$) than the RMSE between the ground truth and surrogate samples.

translated by 谷歌翻译

Internet of Things: Digital Footprints Carry A Device Identity

Rajarshi Roy Chowdhury , Azam Che Idris , Pg Emeroylariffion Abas

分类：机器学习

2023-01-01

The usage of technologically advanced devices has seen a boom in many domains, including education, automation, and healthcare; with most of the services requiring Internet connectivity. To secure a network, device identification plays key role. In this paper, a device fingerprinting (DFP) model, which is able to distinguish between Internet of Things (IoT) and non-IoT devices, as well as uniquely identify individual devices, has been proposed. Four statistical features have been extracted from the consecutive five device-originated packets, to generate individual device fingerprints. The method has been evaluated using the Random Forest (RF) classifier and different datasets. Experimental results have shown that the proposed method achieves up to 99.8% accuracy in distinguishing between IoT and non-IoT devices and over 97.6% in classifying individual devices. These signify that the proposed method is useful in assisting operators in making their networks more secure and robust to security breaches and unauthorized access.

translated by 谷歌翻译

LMFLOSS: A Hybrid Loss For Imbalanced Medical Image Classification

Abu Adnan Sadi , Labib Chowdhury , Nursrat Jahan , Mohammad Newaz Sharif Rafi , Radeya Chowdhury , Faisal Ahamed Khan , Nabeel Mohammed

分类：计算机视觉 | 人工智能

2022-12-24

Automatic medical image classification is a very important field where the use of AI has the potential to have a real social impact. However, there are still many challenges that act as obstacles to making practically effective solutions. One of those is the fact that most of the medical imaging datasets have a class imbalance problem. This leads to the fact that existing AI techniques, particularly neural network-based deep-learning methodologies, often perform poorly in such scenarios. Thus this makes this area an interesting and active research focus for researchers. In this study, we propose a novel loss function to train neural network models to mitigate this critical issue in this important field. Through rigorous experiments on three independently collected datasets of three different medical imaging domains, we empirically show that our proposed loss function consistently performs well with an improvement between 2%-10% macro f1 when compared to the baseline models. We hope that our work will precipitate new research toward a more generalized approach to medical image classification.

translated by 谷歌翻译

Event knowledge in large language models: the gap between the impossible and the unlikely

Carina Kauf , Anna A. Ivanova , Giulia Rambelli , Emmanuele Chersoni , Jingyuan S. She , Zawad Chowdhury , Evelina Fedorenko , Alessandro Lenci

分类：自然语言处理 | 人工智能

2022-12-02

People constantly use language to learn about the world. Computational linguists have capitalized on this fact to build large language models (LLMs) that acquire co-occurrence-based knowledge from language corpora. LLMs achieve impressive performance on many tasks, but the robustness of their world knowledge has been questioned. Here, we ask: do LLMs acquire generalized knowledge about real-world events? Using curated sets of minimal sentence pairs (n=1215), we tested whether LLMs are more likely to generate plausible event descriptions compared to their implausible counterparts. We found that LLMs systematically distinguish possible and impossible events (The teacher bought the laptop vs. The laptop bought the teacher) but fall short of human performance when distinguishing likely and unlikely events (The nanny tutored the boy vs. The boy tutored the nanny). In follow-up analyses, we show that (i) LLM scores are driven by both plausibility and surface-level sentence features, (ii) LLMs generalize well across syntactic sentence variants (active vs passive) but less well across semantic sentence variants (synonymous sentences), (iii) some, but not all LLM deviations from ground-truth labels align with crowdsourced human judgments, and (iv) explicit event plausibility information emerges in middle LLM layers and remains high thereafter. Overall, our analyses reveal a gap in LLMs' event knowledge, highlighting their limitations as generalized knowledge bases. We conclude by speculating that the differential performance on impossible vs. unlikely events is not a temporary setback but an inherent property of LLMs, reflecting a fundamental difference between linguistic knowledge and world knowledge in intelligent systems.

translated by 谷歌翻译

BSpell: A CNN-blended BERT Based Bengali Spell Checker

Chowdhury Rafeed Rahman , MD. Hasibur Rahman , Samiha Zakir , Mohammad Rafsan , Mohammed Eunus Ali

分类：自然语言处理

2022-08-20

孟加拉语键入大多是使用英语键盘进行的，并且由于存在化合物和类似明显的字母，因此可能是错误的。拼写错误的单词的拼写校正需要了解单词键入模式以及用法一词的上下文。我们提出了一个专业的BERT模型，Bspell针对词校正句子级别。Bspell包含一个可训练的CNN子模型，名为Semanticnet以及专门的辅助损失。这使得Bspell在存在拼写错误的情况下专门研究高度易转的孟加拉词汇。我们进一步提出了将单词级别和字符水平掩蔽组合的混合预读方案。利用这种预审前的方案，BSPELL在现实生活中的孟加拉语拼写校正验证设置中实现了91.5％的准确性。对两个孟加拉语和一个印地语拼写校正数据集进行了详细比较，显示了拟议的Bspell优于现有咒语检查器的优势。

translated by 谷歌翻译

An Approach of Adjusting the Switch Probability based on Dimension Size: A Case Study for Performance Improvement of the Flower Pollination Algorithm

Tahsin Aziz , Tashreef Muhammad , Md. Rashedul Karim Chowdhury , Mohammad Shafiul Alam

分类：神经与进化计算

2022-08-20

大自然影响了许多元元素算法。在过去的几十年中，它们的数量一直在升级。这些算法中的大多数试图模仿自然的生物学和物理现象。这项研究集中在花授粉算法上，该算法是几种生物启发的算法之一。建议使用特定的全球授粉和局部授粉策略，建议在限制空间中进行花粉谷物探索和剥削。作为一种“群”元元素算法，其强度在于找到最佳解决方案的附近，而不是识别最小值。这项工作详细介绍了对原始方法的修改。这项研究发现，通过更改“开关概率”的特定值，具有不同尺寸和功能的动态值，结果主要比原始花授粉法改进。

translated by 谷歌翻译

Judge a Sentence by Its Content to Generate Grammatical Errors

Chowdhury Rafeed Rahman

分类：自然语言处理

2022-08-20

数据稀疏性是语法误差校正（GEC）的众所周知的问题。生成合成训练数据是针对此问题的一种广泛提出的解决方案，并允许模型近年来实现最新的（SOTA）性能。但是，这些方法通常会产生不切实际的错误，或者旨在仅一个错误生成句子。我们提出了一种基于学习的两个阶段方法，用于GEC的合成数据生成，从而放宽了仅包含一个错误的句子的约束。错误是根据句子优点产生的。我们表明，经过合成生成的语料库训练的GEC模型优于先前工作的合成数据的模型。

translated by 谷歌翻译

ANOVA-based Automatic Attribute Selection and a Predictive Model for Heart Disease Prognosis

Mohammed Nowshad Ruhani Chowdhury , Wandong Zhang , Thangarajah Akilan

分类：机器学习

2022-07-30

研究表明，心血管疾病（CVD）对人类健康是恶性的研究。因此，重要的是具有有效的CVD预后方法。为此，医疗保健行业采用了基于机器学习的智能解决方案，以减轻CVD预后的手动过程。因此，这项工作提出了一种信息融合技术，该技术通过分析方差（ANOVA）和域专家的知识结合了人的关键属性。它还引入了新的CVD数据样本集，用于新兴研究。进行了三十八个实验，以验证四个公开可用基准数据集中提出的框架的性能以及在这项工作中新创建的数据集。消融研究表明，所提出的方法可以达到竞争平均平均准确性（MAA）为99.2％，平均AUC平均AUC为97.9％。

translated by 谷歌翻译

Knowledge Graph Induction enabling Recommending and Trend Analysis: A Corporate Research Community Use Case

Nandana Mihindukulasooriya , Mike Sava , Gaetano Rossiello , Md Faisal Mahbub Chowdhury , Irene Yachbes , Aditya Gidh , Jillian Duckwitz , Kovit Nisar , Michael Santos , Alfio Gliozzo

分类：人工智能 | 自然语言处理

2022-07-11

研究部门在组织中推动创新的重要作用。随着速度和量的信息增长，绘制见解，跟随趋势，保持新的研究以及制定策略的配制策略越来越越来越具有挑战性。在本文中，我们介绍了一个用例，即公司研究界如何利用语义网络技术来诱导从结构化和文本数据中诱导统一的知识图，通过整合与研究项目相关的社区使用的各种应用程序，学术论文，学术论文，数据集，成就和认可。为了使应用程序开发人员更容易访问知识图，我们确定了一组通用模式，用于利用诱导的知识并将其视为API。这些模式是从用户研究中诞生的，这些模式确定了最有价值的用例或用户疼痛点要缓解。我们概述了两个不同的方案：用于业务使用的建议和分析。我们将详细讨论这些方案，并针对实体建议提供经验评估。所使用的方法和从这项工作中学到的教训可以应用于面临类似挑战的其他组织。

translated by 谷歌翻译

BIO-CXRNET: A Robust Multimodal Stacking Machine Learning Technique for Mortality Risk Prediction of COVID-19 Patients using Chest X-Ray Images and Clinical Data

Tawsifur Rahman , Muhammad E. H. Chowdhury , Amith Khandakar , Zaid Bin Mahbub , Md Sakib Abrar Hossain , Abraham Alhatou , Eynas Abdalla , Sreekumar Muthiyal , Khandaker Farzana Islam , Saad Bin Abul Kashem

分类：计算机视觉 | 机器学习

2022-06-15

快速准确地检测该疾病可以大大帮助减少任何国家医疗机构对任何大流行期间死亡率降低死亡率的压力。这项工作的目的是使用新型的机器学习框架创建多模式系统，该框架同时使用胸部X射线（CXR）图像和临床数据来预测COVID-19患者的严重程度。此外，该研究还提出了一种基于nom图的评分技术，用于预测高危患者死亡的可能性。这项研究使用了25种生物标志物和CXR图像，以预测意大利第一波Covid-19（3月至6月2020年3月至6月）在930名Covid-19患者中的风险。提出的多模式堆叠技术分别产生了89.03％，90.44％和89.03％的精度，灵敏度和F1分数，以识别低风险或高危患者。与CXR图像或临床数据相比，这种多模式方法可提高准确性6％。最后，使用多元逻辑回归的列线图评分系统 - 用于对第一阶段确定的高风险患者的死亡风险进行分层。使用随机森林特征选择模型将乳酸脱氢酶（LDH），O2百分比，白细胞（WBC）计数，年龄和C反应蛋白（CRP）鉴定为有用的预测指标。开发了五个预测因素参数和基于CXR图像的列函数评分，以量化死亡的概率并将其分为两个风险组：分别存活（<50％）和死亡（> = 50％）。多模式技术能够预测F1评分为92.88％的高危患者的死亡概率。开发和验证队列曲线下的面积分别为0.981和0.939。

translated by 谷歌翻译